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A critical requirement for diverse applications in Quantum Information Science is the capability 
to disseminate quantum resources over complex quantum networks 00 For example, the coherent 
distribution of entangled quantum states together with quantum memory to store these states can 
enable scalable architectures for quantum computation Q , communication Q , and metrology . As 
, a significant step toward such possibilities, here we report observations of entanglement between two 

f^**) ■ atomic ensembles located in distinct apparatuses on different tables. Quantum interference in the 

detection of a photon emitted by one of the samples projects the otherwise independent ensembles 
£SJ , into an entangled state with one joint excitation stored remotely in 10 s atoms at each site After 

! * . a programmable delay, we confirm entanglement by mapping the state of the atoms to optical fields 

and by measuring mutual coherences and photon statistics for these fields. We thereby determine a 
quantitative lower bound for the entanglement of the joint state of the ensembles. Our observations 
provide a new capability for the distribution and storage of entangled quantum states, including for 
scalable quantum communication networks gg. 

t-H ' 

Entanglement is a uniquely quantum mechanical property of the correlations among various components of a 
physical system. Initial demonstrations of entanglement were made for photon pairs from the fluorescence in atomic 
cascades 0, H| and from parametric down conversion !9j] . More recently, entanglement has been recognized as a critical 
resource for accomplishing tasks that are otherwise impossible in the classical domain [jj . Spectacular advances have 

__, \ been made in the generation of quantum entanglement for diverse physical systems M ; including entanglement 

I/"") ■ stored for many seconds in trapped ions for distances on the millimeter scale |lOt fll| . long-lived entanglement of 
macroscopic quantum spins persisting for milliseconds on the centimeter scale |l2j, and remote entanglement carried 

"*^h , by photon pairs over distances of tens of kilometers of optical fibers !13|. 

For applications in quantum information science, entanglement can be created deterministically by way of precise 
control of quantum dynamics for a physical system, or probabilistically by way of quantum interference in a suitable 
measurement with random instances of success. In the latter case, it is essential that success be heralded unambigu- 
ously so that the resulting entangled state is available for subsequent utilization. In either case, quantum memory is 
required to store the entangled states until they are required for the protocol at hand. 

There are by now several examples of entangleme nt g enerated "on demand," beginning with the realization of 
the EPR paradox for continuous quantum variables |l4| and the deterministic entanglement of the discrete internal 

V^l ' states of two trapped ions Important progress has been made towards measurement-induced entanglement on 
various fronts, including the observation of entanglement between a trapped ion and a photon pj|- 

Against this backdrop, here we report the initial observation of entanglement created probabilistically from quantum 
interference in the measurement process, with the resulting entangled state heralded unambiguously and stored in 
quantum memory for subsequent exploitation. As illustrated in Fig. 1, the detection of a photon from either of 
two atomic ensembles (L, R) in an indistinguishable fashion results in an entangled state with one "spin" excitation 
shared at a distance of 2.8 m between the ensembles and distributed symmetrically among ~ 10 5 atoms at each site 
I™. Confirmation of entanglement is achieved by mapping this stored excitation onto light fields after a 1 /is delay 
|a.lr7| and by suitable measurements of the quantum statistics of the resulting optical fields. Our results provide the 
first realization of the capability to transfer a stored entangled state of matter to an entangled state of light. 

Our experiment is motivated by the protocol of Duan, Lukin, Cirac and Zoller (DLCZ) [6j for the realization of 
scalable quantum communication networks with atomic ensembles. The DLCZ protocol introduced a number of novel 
ideas for quantum information processing and is the subject of active investigation. In this direction, nonclassical 
correlations 

dEBIIJIiilEimiHand entan S lement El have bee 

n observed between pairs of photons emitted 
by a single atomic ensemble. Observations of coherence between two cylindrical volumes of cold Rubidium atoms 
within a single magneto-optical trap have also been reported |26j |. although entanglement was not demonstrated 
between the two regions [271 u3 • 

A simple schematic of our experiment is given in Fig. 1, with further details provided in Refs. [rH l2ll |23| . For the 
writing stage of the protocol, two classical pulses traverse the L, R ensembles in parallel and generate fields 1l, lfl 
by spontaneous Raman scattering (see Fig. la). The intensity of the pulses is made sufficiently weak such that the 
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probability of creating more than one excitation in the symmetric collective mode 6] of the ensemble is very low [2l| 




FIG. 1: An overview of our experiment to entangle two atomic ensembles is shown, (a) Setup for generating entanglement 
between two pencil-shaped ensembles L and R located within spherical clouds of cold Cs atoms. The atomic level structure 
for the writing process consists in the initial ground state \g)(6Si/2, F = 4 level of atomic Cesium), the ground state |s) for 
storing a collective spin flip (6Si/2,F = 3 level), and the excited level \e) (6P 3 m,F' = 4). The transition \g) — > |e) in each 
ensemble is initially coupled by a write pulse detuned from resonance to generate the forward-scattered anti-Stokes field 1 from 
the transition |e) — > s). The L,R ensembles are excited by synchronized writing pulses obtained from beam splitter TSS W . 
After filtering, the Stokes fields 1l, 1r are collected, coupled to fiber-optic channels, and interfere at beam splitter BSi, with 
outputs directed towards two single-photon detectors Di a ,Du. (b) Schematic for verification of entanglement between the 
L, R ensembles by conversion of atomic to field excitation by way of simultaneous read pulses obtained from BSij. The read 
pulses reach the samples after a programmable delay from the write pulses, and couple the transition |s) — » |e') (|e') being the 
§P\/2,F' — 4 level), leading to the emission of the forward-scattered Stokes fields 2l,2r from the transition |e') — > \g). The 
inset (i) shows the configuration used to measure the diagonal elements ptj of p2 L ,2 R in Eq. [2] from the photo-detection events 
at D2a, D2b, F>2c- Reconfiguring the fiber connections we can easily pass from the configuration of inset (i) to the one of inset 
(ii), which is used to generate interference of the 2l,2_r fields at beam splitter BS2 to measure the off-diagonal coherence d 
in p2 L ,2 R - In (a) and (b), the incident write and read beams are orthogonally polarized and combined at a polarizing beam 
splitter, and are focused to a waist of about 30/j,m in the sample region. All beam splitters BS are polarization maintaining 
fiber beam splitters. The ~ 12m arms of both write and read interferometers are actively stabilized using an auxiliary Nd:YAG 
laser at 1.06 /im. 



Entanglement between the L, R ensembles is created by combining the output fields \l, Ir on the beam splitter 
BSi, with outputs directed to two photodetectors Di ai Du, (see Fig. la). For small excitation probability and with 
unit overlap of the fields at BS\, a detection event at D\ a or Du, arises indistinguishably from either field lr or In, 
so that the L, R ensembles are projected into an entangled state, which in the ideal case can be written as [rll2^ 



\*l,r) =e L \0) L \l) R ±e^e R \l) L \0} R , 



(1) 
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where \Q)l.r, refers to the two ensembles L,R with 0,1 collective excitations respectively, 6l(£r) is the nor- 

malized amplitude of photon generation from ensemble L(R), and the sign ± is set by whichever detector records the 
event. The phase rji = Af] w + A71, where Af3 w is the phase difference of the write beams at the L, R ensembles, and 
A71 is the phase difference acquired by the 1^, 1r fields in propagation from the ensembles to the beam splitter BS±. 
Note that to achieve entanglement as in Eq. ^ 771 has to be kept constant from trial to trial. 

To verify the entanglement, we map the dclocalizcd atomic excitation into a field state by applying simultaneously 
strong read beams at the two ensembles (see Fig.fb). If the state transfer were to succeed with unit probability, the 
conditional state \^l,r) of the ensembles would be mapped to an entangled state of two modes for the Stokes fields 
2 L ,2 R given in the ideal case by UM 



\^LR)=e L \l) 2L \0) 2R ±e 



i(Vl+V2) 



cfl|0>ajl> 



(2) 



where \0)2 L ,2 R , \1)2 L ,2 R refers to the Raman fields 2^, 2# with 0,1 photons, respectively. Here, 772 = A/3 r + A72, where 
Af3 r is the phase difference of the read beams at the L, R ensembles, and A72 is the phase difference acquired by 
the fields in propagation from the ensembles to the beam splitter BS2 in Fig. fb. In our experiment, the 

phases 771,772 can be independently controlled and are actively stabilized by utilizing auxiliary fields at 1.06 /mi that 
copropagate along the paths of the write and read beams and of the IlAr an d 2^,2^ fields. 

Of course, the states in Eqs. 1 1121 are idealizations that must be generalized to describe our actual experiment 
0, 127], ■ Specifically, the presence of various sources of noise necessarily transforms these pure states into mixed 
states. Eqs. also neglect the vacuum contribution as well as higher-order terms, which are intrinsic to DLCZ 
protocol and which otherwise arise due to diverse experimental imperfections. Moreover, the above analysis assumes 
that all excitations are in the correct "modes" (both for optical fields and for the collective atomic "spin flips"), that 
excitations of the ensembles map f-to-f to photons in fields 1 and 2, and that diverse sources of background light are 
absent. 

The procedure that we have devised to provide a robust, model independent determination of entanglement is based 
upon quantum tomography of the 2l , 2# fields (see Appendix B for details) . Since entanglement cannot be increased 
by local operations on either of the two ensembles, the entanglement for the state of the ensembles will be always 
greater than or equal to that measured for the state of the light fields. Specifically, conditioned upon a detection at 
Di a or Dib, we consider the density matrix 



1 

P2 L ,2 R = -p 



/Poo 

Poi 

d* Pw 

V p n 



(3) 



which is written in the basis \n)2 L \m)2 R1 with the number of photons {n, m} = {0, 1}, pij is then the probability to 
find i photons in mode 2L and j photons in mode 2R, and d gives the coherence between the \1)2 L 10)2^ and |0)2f, |1)2 R 
states. f>2 L ,2 R is obtained from the full density matrix P2 L ,2 R by restricting it to the subspace where there is at most 
one photon in each mode, with then P = poo + P01 +P10 + Pn- Since the concurrence C{p2 L ,2 R ) for p2 L ,2 R provides a 
lower bound for the concurrence C(p2 L ,2 R ) for p 2L ,2 R [C{p2 L ,2 R ) > PC(P2 L ,2 R )]i we devise measurements to deduce 
the various components of p~2 L ,2 R - The concurrence C{p2 L ,2 R ) can then be calculated from Eq. 01 by way of Ref. [30| . 



PC 



c(2|d|-2V(pooPii),0) 



(4) 



The entanglement of formation E follows directly from C , where E and C both range from to I for our system and 
E is a monotonically increasing function of C 



;tly I 



Probability 


D la 


D lb 


Poo 


0.98510 ± 0.00007 


0.98501 ± 0.00007 


P10 


(7.38 ±0.05) x 10 -3 


(6.19 ±0.04) x 10" 3 


Poi 


(7.51 ±0.05) x 10" 3 


(8.78 ±0.05) x 10~ 3 


pu 


(1.7 ±0.2) x 10" 5 


(1.9 ±0.2) x 10" 5 



TABLE I: Diagonal elements of the density matrix p~2 L ,2 R [Eq. (3)], deduced from the records of photo-electric counts. The 
values of py are referenced to the location of detectors D2 a ,2b,2c, and were obtained by considering unit detection efficiency, 
which gives a more conservative (i.e., smaller) lower bound for the concurrence than the actual (larger) field concurrence for 
finite efficiency < 1. See Appendix A for further details. 
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As a first step in the determination of C we measure the diagonal elements pij. As shown in Fig. lb, the field-2 
output of each ensemble is directed to different sets of detectors in order to record photon counting probabilities for 
the fields 2l,2r separately. From the record of photoelectric counting events, we then deduce the diagonal elements 
of P2 L ,2 R , which are listed in Table 1. From Eq.^Jand noting that \d\ 2 < pioPoi, a necessary requirement for C > is 
that there be a suppression of two-photon events relative to the square of the probability for single-photon events for 

f2^ (2) 

the fields 2l, 2r i.e., h} c ' = Pn/(pioPoi) < 1- For our measurements, we find h c = 0.30 ±0.04 for events conditioned 

(2) I | 

on detection at D la , and h y c = 0.35±0.04 for events conditioned on Du, [211] . In contrast, for non-conditioned events, 
we find h [ nl = 0.99 ± 0.04. 

The second step in our tomography protocol is to determine the coherence term d in Eq. |3 which we accomplish by 
adding a relative phase shift ip for the fields 2^,2^, and by combining them at the beam splitter BS2 shown in Fig. 
lb. By recording the conditional count rate after the beam splitter as function of <p, one can measure an interference 
fringe with a visibility V, with then |d| following from V and the p^. Roughly, for 50/50 beam splitters and neglecting 
higher order terms (that are employed in our actual analysis), we would have |d| = V(pio + poi)/2. 

Figure 2 shows conditional counts A^a, A^t + N% c as functions of (p. These data demonstrate that the indistin- 
guishable character of measurement events at detectors D la (Fig-2a) and Du, (Fig. 2b) induces a high degree of 
coherence between the otherwise independent ensembles L,R [a,|2g. Indeed, we deduce visibilities V\ a = (70 ± 2)% 
and Vib = (71 ± 2)% for the associated conditional states. 

A notable feature of these results is that the interference fringes have relative phase ir for the cases of detection at 
D\ ai Du,, in agreement with Eqs. HE] We observe similar fringes if the phase r\\ between the write beams is varied 
instead of ip. Moreover, if the fields 1^, 1r are combined at the beamsplitter BSi with orthogonal polarizations (by 
way of the half- wave plate in Fig. la), we find that the visibility from interference of fields 2^,2^ drops to near zero, 
since in this case, there is no longer measurement-induced entanglement associated with quantum interference for 
detection of fields 1l, Ifl (see Appendix A). 

With Eq. 21 the measured values for the visibility V and for the various p^ are sufficient to deduce a lower bound 
for the concurrence C for the field state p~2 L ,2 R at the location of detectors -D2a,2fc,2c, with no correction for detection 
efficiencies or propagation losses, we find 

C la {p2 L .2n) = (2-4 ± 0.6) X 10- 3 > , C lb {~ P 2 L .2n) = (1-9 ± 0.6) x 10~ 3 > , (5) 

conditioned upon detection at either D\ a or Di . This conclusively demonstrates a nonzero degree of entanglement 
between the ensembles, albeit with the concurrence Cl,r small. The small difference between the concurrence for the 
states conditioned on D\ a or D\ can be explained by an asymmetry in BS W (see Appendix A). 

Beyond the firm lower bound given by Eq. we can make a better estimate of the degree of entanglement 
Cl,r between the L, R ensembles by way of detailed measurements of the propagation efficiencies from the atomic 
ensembles to the plane zq of the detectors shown in Fig. lb (see Appendix A). Figure 3 gives an inference of the 
density matrix p?f 2 and thereby of the concurrence C Zi {J>2 L 2r ) at zq and at two other locations 2j=i )2 along the 
path from the ensembles to the detectors (see Fig. lb), assuming a constant visibility. Generally, C increases in direct 
correspondence to the reduced level of losses for the 2^, 2r fields at locations closer to the ensembles. At location Z2 
corresponding to the output edges of the atomic ensembles, we find the result 

Cly > CS(p£, 2R ) =s 0.021 ± 0.006 > , Cl b R > C^p^J ~ 0.016 ± 0.006 > . (6) 

To move beyond this result, we need more detailed information about the efficiencies £,l,r with which stored excitation 
in the atomic ensembles is converted to the propagating light fields 2l,2r. Our earlier measurements included 
comparisons to a simple model j2lj and allowed an inference £,l,r ~ 0.10 ± 0.05. The measurement of the losses 
together with the values of p^ at the detectors yields pio + poi ~ H% at the output of the ensembles (22 plane) for 
our current experiment. This value together with the estimated {,l,r then indicates that poo ~~ > for the conditional 
state pl,r of the ensembles, so that Cl,r ~ V » 0.7, suggesting that pl,r is close to the ideal entangled state of 
Eq. ^ The low measured values for the entanglement between fields 2^,2^ apparently are principally a consequence 
of the low readout efficiency £l,r of the atomic excitation. We stress that this inference of C for the state inside 
the ensembles must be confirmed by subsequent experiments and is offered here to provide some insight into future 
prospects for quantum protocols with entangled ensembles. This also emphasizes that a central point in subsequent 
work should be the improvement of £,l,r- 

In conclusion, we have achieved entanglement between a pair of atomic ensembles separated by 2.8 m, with the 
entangled state involving one spin excitation within a collective system of roughly 10 5 atoms at each site L,R. The 
entangled state is generated by and conditioned upon an initial detection event, and is thus produced in a probabilistic 
fashion. However, this initial event heralds unambiguously the creation of an entangled state between L, R ensembles, 
which is physically available for subsequent utilization, as, for example, by mapping to propagating optical fields, 
which can be in principle accomplished with high efficiency. We emphasize that our measurements relate to an actual 
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FIG. 2: Coherence between the atomic ensembles L, R is induced by a measurement event of the fields 1l, 1r at detector D\ a or 
Dit,. Shown is the number of coincidences A^a (circles) and A^j, + N2 C (triangles) recorded by the respective detectors D^a fibflc 
for the fields 2l,2r with the interferometer arrangement of Fig. lb as a function of the relative phase if. In (a) N2 a ,2b,2c are 
conditioned upon a detection event at Di a with no count at Dit, while in (b) N2 a ,2b,2c are conditioned upon an event at Du 
with no count at D\ a . At each setting of data are acquired for 150 s with a detection window of width 190 ns. Although 
the interference fringes have comparable visibility, the different sizes arise from unequal quantum efficiencies for detectors 7>2a 
and D2b,2c (see Appendix A). The visibility values are obtained from an average of the visibilities of the red and black curves, 
respectively. Error bars reflect ± one standard deviation due to the finite number of counts. 



physical state of the L,R ensembles and of the 2^,2/j fields, and are not an inference of a state based upon post 
selection. Our work provides the first example of a stored atomic entangled state that can be transfered to entangled 
light fields, and significantly extends laboratory capabilities for entanglement generation, with now entangled states 
of matter stored with separation a hundred-fold larger than was heretofore possible for continuous quantum variables 
and a thousand-fold larger than for qubits. With our current setup, we have demonstrated At s ~ 1 (is for storing 
entanglement. However, this should be readily extended to At s ~ 10 /xs, and new trapping schemes have the potential 
to lead to At s ~ 1 s [T3|. The distance scale for separating the L,R ensembles is limited by the length l ~ 2 km 
for fiber optic attenuation at our write wavelength 852 nm. Extensions to scalable quantum networks over larger 
distances requires the realization of a quantum repeater 0, for which we have now laid the essential foundation. 
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FIG. 3: The results of our measurements for the concurrence C z ' (a) and density matrix p^ L 2 R O 3 ) are shown at the three 
locations z; indicated in Fig. lb. At each location, two pairs of results are given corresponding to the measurement-induced 
state created from detection at D\ a and at Dit, and taking into account the efficiency of the detectors and propagation losses, 
(a) Concurrence C, for two different detection windows Atd at D2a,2t,2c- Filled symbols are for Atd = 190 ns, enough to acquire 
the whole temporal wavepacket of field 2. Open symbols are for Atd = 120 ns. We see then that the degree of entan glem ent 
can be further enhanced, similar to the increase of nonclassical correlations between fields 1 and 2 reported in Ref. |23| for 
specific detection windows for these fields. All values shown in this figure, including the ones for zo, are already corrected for 
the efficiencies of the detectors. Error bars reflect ± 1 standard deviation, taking into account the finite number of counts and 
the uncertainties in the efficiency and propagation loss, (b) Density matrix p z ^ 2 given in the basis \n)2 L \rn)i R corresponding 
to Eq. El with {n, m} = {0, 1} and At d = 190 ns. 
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APPENDIX A: EXPERIMENTAL DETAILS 



1. Atomic ensembles and optical pulses 



Each of the L,R atomic ensembles is obtained from Cesium atoms in a magneto-optical trap (MOT) [TtI l2l| . 
Measurements are carried out in a cyclic fashion consisting first of a period of cooling and trapping to form the MOT, 
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followed by an interval during which the magnetic fields for the MOT are switched off. After waiting 3 ms for the 
magnetic field to decay [Tt| . we initiate a sequence of measurement trials, where for each trial the atoms are initially 
prepared in level \g). The write pulse is at 852 nm, with a duration of 150 ns and is detuned 10 MHz below the 
\g) — > |e) transition. The read pulse is at 894 nm, with a duration of 130 ns and is resonant with the \s) — > |e') 
transition. At the end of each trial, the sample is pumped back to level |<7) by illuminating the atomic cloud with 
trapping and repumping light for 0.7/iS and 1 [is respectively, and then a new trial is initiated with period of 3 /is. 
The total duration for a sequence of measurement trials is 5 ms, after which the measurement interval is terminated 
and a new MOT is formed in preparation for the next sequence of trials. 

2. Losses and efficiencies 

In order to infer the concurrence of the fields at different locations in our experimental apparatus it is important to 
characterize the losses in the different components of the communication channel of field 2. Our measurements for such 
losses are given in the table below for the pathways starting from each of the atomic ensembles. Immediately after the 
ensembles, we separate the classical pulses from fields 1 and 2 in the single-photon level. This procedure is explained 
in detail in Refs. 0, |2l|. The losses coming from this stage are due mainly to passage through paraffin coated 
vapor cells, which have transmission ctf c . After the filtering process, field 2 is coupled to a polarization maintaining 
fiber that carries it to the detection region. The coupling efficiency is denoted by a c . In the detection region, it is 
important first to filter the 1.06 /mi light that propagates together with field 2, and which is used to actively lock 
the read interferometer when BS2 is inserted. In order to filter the 1.06 /im light, field 2 comes out of the fiber and 
pass trough a bulk low-pass filter that reflects 1.06 /im light and transmits with high efficiency at 894 nm. At this 
stage, we also include an extra filter for 852 nm, to cut any residual light from the write process, and then field 2 
is coupled again to a fiber, this time a multi-mode one. This whole process of passing through the band-pass filters 
and recoupling to fiber is called generally "filter for 1064 nm", and is characterized by a transmission af. Finally, the 
detector efficiencies are given by olapd- Note that a single number is given for the pair of detectors T)ib,ia since they 
both have measured efficiencies of 40%. 



Description 


Symbol 


Value for ensemble L 


Value for ensemble R 


Error 


filter cell 


Q/c 


0.80 


0.80 


0.02 


fiber coupling 




0.70 


0.65 


0.02 


1064 nm filter 


a } 


0.70 


0.70 


0.02 


detector 


CtAPD 


0.32 


0.40 


0.02 


Total 


a 


0.13 


0.15 





TABLE 1: List of efficiencies associated with photon 2 propagation and detection. 



3. Suppression of interference between the 2l,2r fields for distinguishable 

detection events from the fields 1l,1.r 

In our experiment, entanglement is generated by quantum interference between the fields 1 emitted by the ensembles, 
that are combined at a beam-splitter. For this interference to occur, the two fields must be indistinguishable, such 
that no information can be obtained about the origin of the photons. A good way to illustrate the importance of 
this overlap is to render the photons distinguishable, for instance by combining the two fields 1 with orthogonal 
polarizations. In this way, information about the origin of the detected photon is near maximal, and the degree of 
measurement-induced entanglement should be significantly reduced (to zero in the ideal case). The results of such 
a measurement are shown in Fig. Fig. EIa,b) shows the interference fringes obtained when the photons of field 
1 are combined with parallel polarizations (same fringes as in Fig. while in Fig. EIc,d), the photons from field 
1 are combined with orthogonal polarizations. In the latter case, the visibility drops to near zero, and there is no 
entanglement between the two ensembles. The residual oscillation in the conditional count rate can be explained by 
the finite polarization extinction ratio in our polarization maintaining fibers. The fibers used in our experiment have 
a measured extinction ratio of 28 dB between their two orthogonal propagation modes. This can lead to a residual 
visibility of 8 %, which is compatible with the amplitude of the residual oscillation in Fig.0Jc,d). 
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4. Asymmetry in the creation of the states conditioned on D\ a and Dib 

The difference in the two sets of probabilities (f>oi , Pro** 1 ) and (f>oi > Pio^ ) results from an asymmetry in the beam 
splitter BSi for detection of the write fields II, Ir, with a measured ratio of transmission to reflection T/R = 0.85. 
Hence, in addition to the ± sign in Eq. 1 of set by detection at Di a or -Dib, the relative amplitudes for the conditional 
state can also differ, resulting in different values for the concurrence. We expect the ratio (Poi /pio )(Poi /Pio ) _1 
to be (T/R) 2 = 0.73, which agrees well with the measured value (7.51/7.38)(8.78/6.19)" 1 = 0.72. 
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FIG. 4: Number of coincidences N^ a (squares) and N2t + N% c (circles) recorded by the respective detectors D2a,ib,ic for the 
fields 2l,2r with the interferometer arrangement of Fig. 1(b) as a function of the relative phase ip. Frames (a) and (b) are 
the same as frames (a) and (b) of Fig. 2, i.e., they show the interference fringe between fields 2l,2r as a result of combining 
fields 1l, Ir in an approximately indistinguishable fashion with parallel polarizations. Frames (c) and (d) show the results of 
the same measurement on fields 2l, 2r, but now with fields 1l, Ir combined with orthogonal polarizations. At each setting of 
tp, data are acquired for 150 s with a detection window of width 190 ns. 



5. Verification of single excitation regime 

(2) 

Apart from the parameter h c discussed previously, another measure of the single-photon character of field 2 is 
given by the evaluation of the function gi2 = P12HP1P2) for each ensemble separately where p\2 is the joint 

probability of detecting a photon in field 1 and another in field 2, and pi is the probability of detecting a single photon 
in field i. For the situation of our measurement, we obtained g 12 values of about 9 and 11 for ensembles R and L, 
respectively, which is then an indication of the single-photon character of field 2 emitted by both ensembles separately, 
as discussed in [2l| . 



6. Phase stabilization. 



The w 12 m path lengths for the two interferometers formed by (BS W ,BS\) and by (BSr, BS2) shown in Figures 
la,b, respectively, are held constant by injecting an iodine stabilized Nd:YAG laser into the fiber beam splitters 
BS w ,BSr for the write and read beams. The fields at 1.064 fim emerging from beam splitters BSi, BS2 are directed 
to separate sets of detectors (not shown in the figure), whose outputs are utilized to stabilize the relative path lengths of 
the respective Mach-Zehnder interferometers by feedback to piezoelectric transducers on which are mounted mirrors in 
the paths of the write and read beams in the L arms of the respective interferometers. Since the write and read beams 
are combined together before being focused into the ensembles, the interferometers share a common free space path 
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when they cross the atoms. In order to control them independently and minimize crosstalk, the two interferometers 
are addressed alternately at a rate of 400 kHz. 

7. Normalization and detector configuration 

The role of the three detectors in our experiment is as a second check on the order of magnitude of the two-photon 
events. Our main checks were discussed in the manuscript and in section \A 51 of this Appendix. It is interesting to 
obtain directly po2 and have an idea of its effect on poi- For our measurement, we obtained, with Dib and i?2c, 
P02 = (2.2 ±0.4) x 10" 5 conditioned on detector D\ a , and P02 = (2.4 ±0.4) x 10~ 5 conditioned on detector Du- Note 
that these values are so small that if taken into account, the correction for pio and poi would be negligible. In order 
to simplify our analysis and inversion algorithm, we consider the three detectors as an effective set of two detectors. 
This is done by simply adding the coincidence events between D2b and I?2c to the sum A2& + Ni c , in the same way 
that would result from the use of a non-number-resolving detector. In this case, the measured normalization constant 
P is equal to one, since all measured events contribute to one of the elements of the restricted density matrix. If we 
had used more detectors, the discrepancy of P from unity would be on the order of P02 and negligible compared to 
the experiment accuracy. 

APPENDIX B: THEORY 
1. Entanglement 

For convenience of description we assume the two atomic ensembles L and R to be in the hands of Alice and 
Bob, respectively. The state of the two ensembles conditioned on a click of one of the two detectors D\ a or D\b (see 
Figure lb of main text) is mapped onto a state of multiple field modes belonging to Alice and multiple modes belonging 
to Bob. Because the mapping involves only local operations by Alice and Bob, the entanglement (in particular, the 
entanglement of formation) between their systems cannot increase on average [2lJ ■ Hence the entanglement found 
between Alice's and Bob's field modes is a lower bound on the entanglement between the atomic ensembles. We will 
use this type of reasoning several times here: certain experimental procedures can be exactly mimicked by imagining 
Alice and Bob performing LOCC (local operations and classical communication): those operations can only decrease 
the entanglement we find. We also sometimes (lower) bound the entanglement analytically using quantities that 
are more straightforward to measure in the laboratory. That way, we can unambiguously determine the presence of 
entanglement between the two ensembles, even if we might underestimate its actual magnitude. 

On each side there is one main mode (a traveling continuous-wave mode) into which photons are emitted predomi- 
nantly [fj. Those modes we denote by 2l and 2r. Other modes may be populated with very small probability, but in 
the analysis we assume all detector clicks arise from modes 2^ and 2r. In the experiment this reduction from multiple 
to single modes is mainly accomplished by the use of single mode fibers, which filter out different spatial modes. This 
is a procedure that can be exactly mimicked by Alice and Bob performing that same spatial filtering on their local 
modes and hence can only decrease the actual entanglement. 

We also assume that never more than 2 photons populate each mode. This is an excellent approximation on its 
own (and is supported by our measurements), but more importantly, this assumption corresponds to lower bounding 
the entanglement, as detailed below. 

We furthermore assume that all off-diagonal elements of the density matrix between states with different numbers 
of photons vanish. This is a valid assumption when one takes into account that that phase can only be defined 
relative to a reference laser beam shared by Alice and Bob. Tracing out that laser field sets the off-diagonal elements 
to zero. Indeed, the experiment makes no use of knowledge of the phases of the various lasers used. Moreover, this 
can only underestimate the entanglement, since tracing out the laser modes can be exactly mimicked by Alice and 
Bob performing local operations that makes all those off-diagonal elements zero. Namely, they could each apply a 
random phase shift to their modes, such that the phase difference is fixed (this requires classical but not quantum 
communication), and subsequently ignore the information about the individual phase shifts. The phase difference is 
equal to the phase ip introduced in the main text. 
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This then leaves us with a density matrix of the form 
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We can bound the entanglement of formation of this state by 

E( PA ,b) > PE(P2 L ,2 R ) 

where 

P = Poo+Poi +P10 +Pll 
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One obtains this bound by considering the effects of two local operations by Alice and Bob consisting of measuring 
whether each mode has more than 1 photon or not and communicating this result one to the other. We treat this 
step explicitly and separately from the very similar step mentioned above [where cases with more than two photons 
are filtered out], in order to remind ourselves we do have to keep track of the total probability to find more than 1 
photon in one of the modes, 1 — P. Also, we note explicitly this step does not correspond to any procedure in our 
experiment, but is just an analytic tool to bound the entanglement and express it in terms of quantities that can be 
easily determined without too large uncertainty (unlike higher-order matrix elements such as pi 2 , etc). 



a. Detection window 



In Fig. 3, we show results for a smaller detection window that give a higher estimation for the amount of entanglement 
between the fields. The use of a smaller detection window can also be understood as a reduction from multiple 
to single modes by filtering out different temporal modes. This is a procedure that can be exactly mimicked by 
Alice and Bob performing that same temporal filtering on their local modes and hence can only decrease the actual 
entanglement (note though, that the estimate of entanglement can be increased by this procedure, even though the 
actual entanglement decreases. The estimated value just gets much closer to the actual value than when not filtering 
out extraneous modes). 

For the possibly remaining multiple modes in the smaller time window, we may suppose Alice and Bob each apply 
the following fictitious transformation 

\ni)\n 2 ) ... — ► | y^n»)|ni,n 2 , ■ ■ -)m 

i 

to their local modes. This transformation collects all photons in one mode (the first ket), and keeps track of where they 
came from in a separate system M (for "memory"), such that the transformation is unitary. This transformation, 
therefore, leaves the entanglement between Alice's and Bob's mode unchanged. Our detectors not being able to 
distinguish different modes within the same time window then boils down to tracing out the memory system, which 
can only decrease the entanglement. 



2. Measurements 



All measurements are performed using Geiger-mode avalanche photodiodes (APDs). We assume there are only 
two outcomes of a photodetection measurement, corresponding to no click or some nonzero number of clicks (indeed, 
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that is the only information used in the experiment). Thus, if the incoming mode 2l is described by an annihilation 
operator a then the photodetector performs a POVM of the form 



n = £(-1)"- 



n>0 

iii = Ia-ti , (B4) 

with I a the identity on 2^ . The corresponding probabilities are denoted by po and p\ , 

p k = Trp 2i n fc , (B5) 

if p2 L is the state of mode 2^. To deduce higher-order probabilities with k > 2, beam splitters can be employed to 
divide and direct the mode a to multiple detectors. 

Joint probabilities for measurements on the two modes 2 l and 2r can be determined in a similar way if we introduce 
the annihilation operator b for mode 2^ and the corresponding operators H^' B for n = 0, 1 for the two modes. The 
operators Ilo and IT above were written in normal order. Similarly, joint measurement probabilities can be written 
as 

P mn = Trp2 L ,2 R : K ® II* :, (B6) 

where all annihilation operators a and b are written to the right of all creation operators a' and . For example, this 
allows one to include easily the effects of finite efficiencies of detectors: If r\ is the efficiency of a detector and a the 
annihilation operator for the mode impinging on the detector, we may replace a — > ^Jrja + ^/l — rjv where v acts on 
an auxiliary mode that is assumed to be in the vacuum state. Terms with nonzero powers of v then do not contribute 
to counting rates provided we evaluate these by using a normally-ordered expression. In that case, we can ignore v 
and just replace a — > y/rja. The same replacement can be used to take into account losses during propagation. 

The most straightforward way to determine f>2 L ,2 R consists of two stages. The first stage determines the diagonal 
elements, the second the off-diagonal elements: From the measured frequencies of joint detection events we obtain 
estimates for the corresponding probabilities for those joint events in terms of the underlying density matrix elements. 
Inverting these expressions gives estimates on the elements of the density matrix. 



a. Diagonal elements 



Conditioned upon detection of an event at either detector D\ a or D\b, the diagonal elements of p2 L ,2 H were measured 
by the setup described in Figure 1 (c) . Photons in mode 2 l are detected by a photodetector D 2a but mode 2r is split 
on a 50/50 (approximately) beamsplitter and photons in the two resulting modes are counted by photodetectors D 2 b 
and D 2c . Starting with mode operators a and b for modes 2^ and 2#, there are several transformations affecting a 
and b. Denoting by a\ the mode operator detected by detector D2 a , and by b\ and b 2 those detected by D 2 b and D 2c , 
the transformations are simply 

ax = \pr\m\ a 
h = y/r)R r l2b/V2 

h = (B7) 

with t]l,r indicating the total efficiency of propagating to the detectors, and ?/i,2.3 the detector efficiencies of detectors 
D 2a , D2b, D2c- One then obtains expressions for the expected joint count probabilities ptim with k,l,m = 0,1 by 
substituting (|BT|) into l|B4|l and l)B6|l . When acting on f>2 L ,2 R the operators a and b are understood to be a (£> Ib and 
1a ® b. 

In the end, we only consider the total number of counts in detectors D 2a and D 2 b,D 2c together, leading to joint 
probabilities Q mn with m = 0, 1 and n = 0, 1, 2, 

Qmn = Pmst- (B8) 

m s+t—n 

Again, for these measurements as well as those in the next section, the detection events at D 2a and D 2 b,D 2c are 
conditioned upon an event at either D\ a or D\b. This gives expressions for the expected detection probabilities Q mn 
as functions of the diagonal elements p mn of the density matrix: conversely, given the experimentally determined Q mn 
we invert the expressions to obtain estimates for p mn . 
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b. Off-diagonal elements 

The off-diagonal elements are measured by including two extra elements: one is a phase shifter in one of the modes, 
say mode 2^, thus replacing a — > exp(i(p)a. This phase is varied to produce the interference pattern ("fringes") of 
Figure 2. Second is an extra 50/50 beamsplitter after the phase shifter [BS 2 in Fig. 1(d)]. One again easily arrives at 
simple expressions for the operators a\ and 6i, 62 detected in terms of a and 6, similar to that of l|B7|l . Just as before, 
one then obtains expressions for the expected joint count probabilities Pkim with k,l,m — 0,1 by substituting these 
expressions for 01,61,62 into l)B4|) and 1|B6|) . We thus find the joint detection probabilities Q^n f° r the "fringes" as 
a function of d and the diagonal elements of p2 L ,2 R - Since we already obtained the diagonal elements in the previous 
step, we then find an estimate of the diagonal element d from the visibility of the fringes. 

Note that we have also carried out measurements with detector £>2a replaced by a beam splitter and a pair of 
APDs, as for detectors D2b, Die- In this way, we confirm explicitly that higher order events with k > 2 at i?2a have 
a negligible impact on our estimates of fringe visibility and of the probabilities Poo,Pio,Poi,Pii that enter into the 
determination of the concurrence C , in agreement with the independent assessment from Dib-, D% c . 

In addition to the above analysis we also performed a maximum likelihood analysis of the density matrix. Given 
the actually detected photon statistics for both measurements of diagonal and off-diagonal elements together one can, 
for each possible candidate density matrix p2 L ,2 R , calculate the probability that the actually obtained measurement 
outcomes occur. Maximizing that probability over all possible density matrices gives the most likely density matrix. 
This estimate has been used as an additional check on the inferred values quoted in the main text and their errors. 
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